Perceptual audio coding using adaptive pre- and post-filters and lossless compression
نویسندگان
چکیده
This paper proposes a versatile perceptual audio coding method that achieves high compression ratios and is capable of low encoding/decoding delay. It accommodates a variety of source signals (including both music and speech) with different sampling rates. It is based on separating irrelevance and redundancy reductions into independent functional units. This contrasts traditional audio coding where both are integrated within the same subband decomposition. The separation allows for the independent optimization of the irrelevance and redundancy reduction units. For both reductions, we rely on adaptive filtering and predictive coding as much as possible to minimize the delay. A psycho-acoustically controlled adaptive linear filter is used for the irrelevance reduction, and the redundancy reduction is carried out by a predictive lossless coding scheme, which is termed weighted cascaded least mean squared (WCLMS) method. Experiments are carried out on a database of moderate size which contains mono-signals of different sampling rates and varying nature (music, speech, or mixed). They show that the proposed WCLMS lossless coder outperforms other competing lossless coders in terms of compression ratios and delay, as applied to the pre-filtered signal. Moreover, a subjective listening test of the combined pre-filter/lossless coder and a state-of-the-art perceptual audio coder (PAC) shows that the new method achieves a comparable compression ratio and audio quality with a lower delay.
منابع مشابه
Lossless and Perceptual Coding of Digital Audio
We have seen rapid progress in high-quality compression of wideband audio signals. Today’s coding algorithms can achieve substantially better compression than was thought possible only a few years ago. In the case of audio coding with its bandwidth of 20 kHz and more, the concept of perceptual coding has paved the way for significant bit rate reductions. However, multiple coding can reveal orig...
متن کاملImproved Forward-Adaptive Prediction for MPEG-4 Audio Lossless Coding
MPEG-4 Audio Lossless Coding (ALS) is a new addition to the suite of MPEG-4 audio coding standards. The ALS codec is based on forward-adaptive linear prediction, which offers remarkable compression even with low predictor orders. Nevertheless, performance can be significantly improved by using higher predictor orders, more efficient quantization and encoding of the predictor coefficients, and a...
متن کاملInteger Wavelet Transform Based Lossless Audio Compression
In this paper we propose the use of integer wavelet [2] as a decorrelation stage for adaptive context based lossless audio coding. The original wideband audio signal is first decomposed in wavelet subbands. The resulted coefficients are integer valued and therefore can be transmitted using an adaptive context based method, in a lossless manner, the decoder being able to reconstruct them and aft...
متن کاملMpeg4 Als – the Standard for Lossless Audio Coding
The MPEG-4 Audio Lossless Coding (ALS) standard belongs to the family MPEG-4 audio coding standards. In contrast to lossy codecs such as AAC, which merely strive to preserve the subjective audio quality, lossless coding preserves every single bit of the original audio data. The ALS core codec is based on forward-adaptive linear prediction, which combines remarkable compression with low complexi...
متن کاملInteger wavelet transforms based lossless audio compression
In this paper we propose the use of integer wavelet [2] as a decorrelation stage for adaptive context based lossless audio coding. The original wideband audio signal is first decomposed in wavelet subbands. The resulted coefficients are integer valued and therefore can be transmitted using an adaptive context based method, in a lossless manner, the decoder being able to reconstruct them and aft...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Trans. Speech and Audio Processing
دوره 10 شماره
صفحات -
تاریخ انتشار 2002